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Untitled Sequence # 2 *> 1 -phase Translation 

EWA fiequence 1983 b.p. TTCCATCTAACC . . . ATGGTGGGCGCC linear 

i/i 31/11 

TTC CAT CTA ACC ACA OCT AAC GGA GAA CCA CAC ATG ATC GTC AGT AGA CAA GAG AAA GGG 

FHLTTRNGEPHMIVSRQEKG 
61/21 91/31 

AAA AGT CTT TTG TTT AAA ACA GAG GAT GGC GTG AAC ATC TGC ACC CTC ATG GCC ATG GAC 

KSLLPKTEDGVNMCTLMAMD 
121/41 151/51 

CTT GOT GAA TTG TCT GAA GAC ACA ATC ACG TAC AAG TGT CCC CTT CTC AGG CAG AAT GAG 

IiGELCEDTlTYKCPLLRQNE 
181/61 2n/71 

CCA GAA GAC ATA GAC TGC TGG TQC AAC TCC ACG TCC ACG TQG OTA ACC TAT GGG ACT TCT 

PEDIDCWCNSTSTWVTYGTC 
241/81 271/91 

ACC ACC ACG GGA GAA CAT AGA AGA GAA AAA AGA TCA GTG GCA CTC GTT CCA CAT OTG GGA 

TTTGEHRREKRSVALVPHVG 
301/101 331/111 

ATC OCA CTC GAC ACC CGA ACT GAA ACA TGG ATG TCA TCA GAA GGG GCT TOG. AAA CAT GCC 

MGLETRTETWMSS-EGAWKHA 
361/121 391/131 

CAG AGA ATT GAA ATT TGG ATC CTG AGA CAT CCA GGC TTC ACC ATA ATG GCA GCA ATC CTC 

QRIEIW1LRHPGFTIMAAIL 
421/141 451/151 

GCA TAC ACC ATA GGG ACG ACA CAT TTC CAG AGA GCA CTG ATT TTC ATC OTA CTC ACA GCT 
AYTIGTTHFQRAL IPILLTA 
481/161 511/171 

GTC GCT OCT TCA ATG ACA ATG COT TGC ATA GGA ATA TCA AAT AGA GAC TTT CTA GAA GGG 

VAPSMTM. RCXGISHRDFVBG 
541/181 571/191 

CTT TCA GGA GGA AGC TGG GIT GAC ATA GTC T2!A GAA CAT GGA AGC TGT GTG ACG ACG ATG 

VSGGSWVDIVLEHGSCVTTM 
601/201 631/211 

GCA AAA AAC AAA CCA ACA TTG GAT TIT GAA CTG ATA AAA ACA GAA GCC AAA CAG OCT GCC 

AKNKPTLDFELIKTBAKQPA 
661/221 691/231 

ACC CTA AGG AAG TAC TOT ATA GAG GCA AAG CTA ADC AAC ACA ACA ACA GAA TCT CGT TGC 

TLRXYCIEAKLTNTTTESRC 
721/241 751/251 

CCA ACA CAA GGG GAA COC AGC CTA AAT GAA GAG CAG GAT AAA AGG TTC GTC TGC AAA CAC 

PTQGBPSLNEBQDKRFVCKH 
781/261 811/271 

TCC ATG OTA GAC AGA GGA TGG GGA AAT GGA TGT GGA OTA TTT GGA AAG GGA GGC ATT GTG 

SMVDRGWGNGCGLFGKGCIV 
B41/281 871/291 

ACC TOT GCT ATG TTC ACA TGC AAA AAG AAC ATG GAG GGA AAA GIT GTG CAG CCA GAA AAC 
TCAMFTCKKNMEGK V" V Q P B N 
901/301 931/311 

TTG GAA TAC ACC ATT GTG CTA ACA CCC CAC TCA GGG GAA GAG CAT GCG GTC GGA AAT GAC 
liBYTI VVTPHSGE BHAVGND 
961/321 991/331 

ACA GGA AAA CAT GGC AAG GAA ATC AAA OTA ACA CCA CAG ACT TCC ATC ACA GAA GCA GAA 
TGKHG KE IKVTPQ S S I TEAE 
1021/341 1051/351 * 

TTG ACA GST TAT GGC ACT GTC ACG ATG GAG TGC TCT CCG AGA ACA GGC CTC GAC TTC AAT 
LTGYGTVTMECSP RTGLDF'N 
1081/361 1111/371 

GAG ATG GTG TTG CTG CAG ATG GAA AAT AAA GCT TGG CTG GTC CAT AGG CAA TGG TTC CTA 

EMVLLQMENKAWLVHRQWFL 
U41/3B1 1171/391 

GAC CTG CCG TTA CCA TGG CTG CCC GGA GCG GAC ACA CAA GGG TCA AAT TGG ATA CAA AAA 
DLPLPWLPGADTCG SNWIQK 
1201/401 1231/411 

GAA ACA TTG GTC ACT TTC AAA AAT CCT CAT GCG AAG AAA CAG GAT GTT GIT GTT TTA GGA 
BTLVTFKNPHAKK Q D V V V X* G 
1261/421 1291/431 

TCC CAA GAA GGG CCC ATG CAC ACA GCA CTC ACA GGC CCC ACA GAA ATC CAA ATG TCA TCA 
SQEGAHHTALTGA ? E IQMSS 
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AGTAAATCCTGTGTGCTAATTGAGGTGCATTGGTCTGCAA 4 0 
ATCGAGTTGCTAGGCAATAAACACATTTGGATTAATTTTA 8 0 
ATCGTTCGTTGAGCGATTAGCAGAGAACTGACCAGAACAT 120 
GTCTGGTCGTAAAGCTCAGGGAAAAACCCTGGGCGTCAAT 1 60 
ATGGTACGACGAGGAGTTCGCTCCTTGTCAAACAAAATAA 200 
210 220 230 240 



■ ' ' 



AACAAAAAACAAAACAAATTGGAAACAGACCTGGACCTTC 240 
AAGAGGTGTTCAAGGATTTATCTTTTTCTTTTTGTTCAAC 280 
ATTTTGACTGGAAAAAAGATCACAGCCCACCTAAAGAGGT 320 
TGTGGAAAATGCTGGACCCAAGACAAGGCTTGGCTGTTCT 360 
AAGGAAAGTCAAGAGAGTGGTGGCCAGTTTGATGAGAGGA 4 00 



410 
■ ■ ■ * ■ ■ ■ ■ I 



420 

* 1 1 ■ • ■ i I 



430 440 
1 ■ I ■ ■ ■ ■ I ■ i i i | 



TTGTCCTCAAGGAAACGCCGTTCCCATGATGTTCTGACTG 4 4 0 
TGCAATTCCTAATTTTGGGAATGCTGTTGATGACGGGTGG 4 8 0 
AATGAAGTTGTCGAATTTCCAGGGGAAGCTTTTGATGACC 520 
ATCAACAACACGGACATTGCAGACGTTATCGTGATTCCCA 560 
CCTCAAAAGGAGAGAACAGATGTTGGGTTCGGGCAATCGA 600 
610 620 



630 



640 



CGTCGGCTACATGTGTGAGGACACTATCACGTACGAATGT 64 0 
CCTAAGCTTACCATGGGCAATGATCCAGAGGATGTGGATT 68 0 
GCTGGTGTGACAACCAAGAAGTCTACGTCCAATATGGACG 720 
GTGCACGCGGACCAGGCATTCCAAGCGAAGCAGGAGATCC 7 60 
GTGTCGGTCCAAACACATGGGGAGAGTTCACTAGTGAATA 800 



810 



820 



830 



840 



AAAAAGAGGCTTGGCTGGATTCAACGAAAGCCACACGATA 8 4 0 
TCTCATGAAAACTGAC-AACTGGATCATAAGGAATCCTGGC 8 8 0 
TATGCTTTCCTGGCGGCGGTACTTGGCTGGATGCTTGGCA 92 0 
GTAACAACGGTCAACGCGTGGTATTTACCATCCTCCTGCT 960 
GTTGGTCGCTCCGGCTTACAGTTTTAATTGTCTGGGAATG 1000 
1010 1020 



1030 



1040 



GGCAATCGTGACTTCAT AGAAGGAGCCAGTGGGGCCACTT 1040 
GGGTGGACTTGGTGCTAGAAGGAGACAGCTGCTTGACAAT 1080 
CATGGCAAACGACAAACCAACATTGGACGTCCGCATGA^T 1120 
AACATCGAAGCTAGCCAACTTGCTGAGGTCAGAAGTTACT 1 1 60 
GCTATCATGCTTCAGTCACTGACATCTCGACGGTGGCTCG 1200 



1210 



1220 



1230 



1240 



GTGCCCCACGACTGGAGAAGCCCACAACGAGAAGCGAGCT 124 0 
GATAGTAGCTATGTGTGCAAACAAGGCTTCACTGACCGTG 1280 
GGTGGGGCAACGGATGTGGATTTTTCGGGAAGGGAAGCAT 1320 
TGACACATGTGCAAAATTCTCCTGCACCAGTAAAGCGATT 1360 
GGG AGAACAATCC AGCCAGAAAAC ATCAAAT AC AAAGTTG 14 00 
1410 1420 1430 1440 

GCATTTTTGTGCATGGAACCACCACTTCGGAAAACCATGG 14 4 0 
GAATTATTCAGCGCAAGTTGGGGCGTCCCAGGCGGCAAAG 14 80 
TTTACAGTAACACCCAATGCTCCTTCGGTAGCCCTCAAAC 1520 
TTGGTGACTACGGAGAAGTCACACTGGACTGTGAGCCAAG 1560 
GAGTGGACTGAACACTGAAGCGTTTTACGTCATGACCGTG 1600 
1610 1620 1630 1640 

GGGTCAAAGTCATTTCTGGTCCATAGGGAGTGGTTTCATG 1640 
ACCTCGCTCTCCCCTGGACGTCCCCTTCGAGCACAGCGTG 1680 
GAGAAACAGAGAACTCCTCATGGAATTTGAAGGGGCGCAC 1720 
GCCACAAAACAGTCCGTTGTTGCTCTTGGGTCACAGGAAG 17 60 
GAGGCCTCCATCATGCGTTGGC AGGAGCCATCGTGGTGGA 1800 



1810 
1 ■ 1 1 I ■ ■ ■ • 1 • ■ ■ 



1820 1830 1840 

1 1 ■' 1 ■ ■ ■ ■ 1 1 .... I 



GTACTCAAGCTCAGTGATGTTAACATCAGGCCACCTGAAA 1840 
TGTAGGCTGAAAATGGACAAACTGGCTCTGAAAGGCACAA 1880 
CCTATGGCATGTGTACAGAAAAATTCTCGTTCGCGAAAAA 1920 
TCCGGTGGACACTGGTCACGGAACAGTTGTCATTGAACTC 1960 
TCCTACTCTGGGAGTGATGGCCCCTGCAAAATTCCGATTG 2000 



2010 2020 
-* ' ■ 1 1 1 • ■ • 1 • ■ • ■ I ■ ■ ■ ■ I • 



2030 



2040 



TTTCCGTTGCGAGCCTCAATGACATGACCCCCGTTGGGCG 2040 
GCTGGTGACAGTGAACCCCTTCGTCGCGACTTCCAGTGCC 2080 
AACTCAAAGGTGCTGGTCGAGATGGAACCCCCCTTCGGAG 2120 
ACTCCTACATCGTAGTTGGAAGGGGAGACAAGCAGATCAA 2160 
CCACCATTGGCACAAAGCTGGAAGCACGCTGGGCAAGGCC 2200 
2210 2220 2230 2240 

TTTTCAACAACTTTGAAGGGAGCTCAAAGACTGGCAGCGT 2240 
TGGGCGACACAGCCTGGGACTTTGGCTCTATTGGAGGGGT 2280 
CTTCAACTCCATAGGAAGAGCCGTTC ACCAAGTGTTTGGT 2320 
GGTGCCTTCAGAACACTCTTTGGGGGAATGTCTTGGATCA 23 60 
CACAAGGGCTAATGGGTGCCCTACTGCTCTGGATGGGCGT 24 00 



2410 



2420 



2430 



2440 



CAACGCACGAGACCGATCAATTGCTTTGGCCTTCTTAGCC 24 40 
ACAGGAGGTGTGCTCGTGTTCTTAGCGACCAATGTGGGCG 24 80 
CCGATCAAGGATGCGCCATCAACTTTGGCAAGAGAGAGCT 2520 
CAAGTGCGGAGATGGTATCTTCATATTTAGAGACTCTGAT 2560 
GACTGGCTGAACAAGTACTCATACTATCCAGAAGATCCTG 2 600 



2610 



2620 



2630 



2640 



TGAAGCTTGCATCAATAGTGAAAGCCTCTTTTGAAGAAGG 264 0 
GAAGTGTGGCCTAAATTCAGTTGACTCCCTTGAGCATGAG 2680 
ATGTGGAGAAGC AGGGCAGATGAGATCAATGCCATTTTTG 2720 
AGGAAAACGAGGTGGACATTTCTGTTGTCGTGCAGGATCC 27 60 
AAAGAATGTTTACC AG AGAGGAACTCATCCATTTTCC AG A 28 00 
2810 2820 2830 2840 

ATTCGGGATGGTCTGC AGTATGGTTGGAAGACTTGGGGT A 284 0 
AGAACCTTGTGTTCTCCCCAGGGAGGAAGAATGGAAGCTT 28 80 
CATCATAGATGGAAAGTCCAGGAAAGAATGCCCGTTTTC A 2920 
AACCGGGTCTGGAATTCTTTCCAGATAGAGGAGTTTGGGA 2 960 
CGGGAGTGTTCACCACACGCGTGTACATGGACGCAGTCTT 3000 
3010 3020 3030 3040 

TGAATACACCATAGACTGCGATGGATCTATCTTGGGTGCA 304 0 
GCGGTGAACGGAAAAAAGAGTGCCCATGGCTCTCC AACAT 3080 
TTTGGATGGGAAGTCATGAAGTAAATGGGACATGGATGAT 3120 
CCACACCTTGGAGGCATTAGATTACAAGGAGTGTGAGTGG 3160 
CCACTGACACATACGATTGGAACATCAGTTGAAGAGAGTG 3200 
3210 3220 3230 3240 

AAATGTTCATGCCGAGATCAATCGGAGGCCCAGTTAGCTC 3240 
TCACAATCATATCCCTGGATACAAGGTTCAGACGAACGGA 3280 
CCTTGGATGCAGGTACCACTAGAAGTGAAGAGAGAAGCTT 3320 
GCCCAGGGACTAGCGTGATCATTGATGGCAACTGTGATGG 3360 
ACGGGGAAAATC AACCAGATCCACCACGGATAGCGGGAAA 3400 
3410 3420 3430 3440 

GTTATTCCTGAATGGTGTTGCCGCTCCTGCACAATGCCGC 344 0 
CTGTGAGCTTCC ATGGTAGTGATGGGTGTTGGTATCCCAT 34 8 0 
GGAAATTAGGCCAAGGAAAACGCATGAAAGCCATCTGGTG 352 0 
CGCTCCTGGGTTACAGCTGGAGAAATACATGCTGTCCCTT 3560 
TTGGTTTGGTGAGCATGATGATAGCAATGGAAGTGGTCCT 3600 
3610 3620 



3630 



3640 



AAGGAAAAGACAGGGACCAAAGCAAATGTTGGTTGGAGGA 364 0 

GTAGTGCTCTTGGGAGCAATGCTGGTCGGGCAAGTAACTC 368 0 

TCCTTGATTTGCTGAAACTCACAGTGGCTGTGGGATTGCA 372 0 

TTTCCATGAGATGAACAATGGAGGAGACGCCATGTATATG 37 60 

GCGTTGATTGCTGCCTTTTCAATCAGACCAGGGCTGCTCA 3800 



3810 3820 3830 

1 1 * ' 1 ' 1 * 1 1 1 1 1 ■ • • ■ 1 • ■ • ■ I ■ • 



3840 



TCGGCTTTGGGCTCAGGACCCTATGGAGCCCTCGGG AACG 384 0 
CCTTGTGCTG ACCCTAGG AGCAGCC ATGGTGGAGATTGCC 3880 
TTGGGTGGCGTGATGGGCGGCCTGTGGAAGTATCTAAATG 3 920 
CAGTTTCTCTCTGCATCCTGACAATAAATGCTGTTGCTTC 3960 
TAGGAAAGCATCAAATACCATCTTGCCCCTCATGGCTCTG 4 000 
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4010 



4020 



4030 



4040 



TTGACACCTGTC ACTATGGCTGAGGTGAGACTTGCCGC AA 4 04 0 
TGTTCTTTTGTGCCATGGTTATCATAGGGGTCCTTCACCA 4 080 
GAATTTCAAGGACACCTCCATGCAGAAGACTATACCTCTG 4120 
GTGGCCCTCACACTCACATCTTACCTGGGCTTGACACAAC 4 1 60 
CTTTTTTGGGCCTGTGTGCATTTCTGGCAACCCGCATATT 4 200 



4210 



4220 



4230 



4240 



TGGGCGAAGGAGTATCCCAGTGAATGAGGCACTCGCAGCA 4 240 
GCTGGTCTAGTGGGAGTGCTGGCAGGACTGGCTTTTCAGG 4280 
AGATGGAGAACTTCCTTGGTCCGATTGCAGTTGGAGGACT 4 320 
CCTGATGATGCTGGTT AGCGTGGCTGGGAGGGTGGATGGG 4 360 
CTAGAGCTCAAGAAGCTTGGTGAAGTTTC ATGGGAAGAGG 4 4 00 



4410 



4420 



4430 



4440 



AGGCGGAGATCAGCGGGAGTTCCGCCCGCTATGATGTGGC 4 4 40 
ACTCAGTGAACAAGGGGAGTTCAAGCTGCTTTCTGAAGAG 4 4 80 
AAAGTGCCATGGGACCAGGTTGTGATGACCTCGCTGGCCT 4 520 
TGGTTGGGGCTGCCCTCCATCCATTTGCTCTTCTGCTGGT 4560 
CCTTGCTGGGTGGCTGTTTCATGTCAGGGGAGCTAGGAGA 4 600 
4610 4620 • 4630 4640 

AGTGGGGATGTCTTGTGGGATATTCCCACTCCTAAGATCA 4 64 0 
TCGAGGAATGTGAACATCTGGAGGATGGGATTTATGGCAT 4 680 
ATTCCAGTCAACCTTCTTGGGGGCCTCCCAGCGAGGAGTG 4720 
GGAGTGGCACAGGGAGGGGTGTTCCACACAATGTGGCATG 4760 
TCACAAGAGGAGCTTTCCTTGTCAGGAATGGCAAGAAGTT 4800 



4810 



4820 



4830 



4840 



GATTCCATCTTGGGCTTCAGTAAAGGAAGACCTTGTCGCC 4 840 
TATGGTGGCTCATGGAAGTTGGAAGGCAGATGGGATGGAG 4880 
AGGAAGAGGTCCAGTTGATCGCGGCTGTTCCAGGAAAGAA 4 920 
CGTGGTCAACGTCCAGACAAAACCGAGCTTGTTCAAAGTG 4 960 
AGGAATGGGGGAGAAATCGGGGCTGTCGCTCTTGACTATC 5000 



5010 



5020 



5030 



5040 



CGAGTGGCACTTCAGGATCTCCTATTGTTAACAGGAACGG 5040 
AGAGGTGATTGGGCTGTACGGCAATGGCATCCTTGTCGGT 5080 
G ACAACTCCTTCGTGTCCGCCATATCCCAGACTGAGGTGA 5120 
AGGAAGAAGGAAAGGAGGAGCTCCAAGAGATCCCGACAAT 5160 
GCTAAAGAAAGGAATGACAACTGTCCTTGATTTTCATCCT 5200 
5210 5220 



5230 



5240 



GGAGCTGGGAAGACAAGACGTTTCCTCCCACAGATCTTGG 52 4 0 
CCGAGTGCGCACGGAGACGCTTGCGCACTCTTGTGTTGGC 52 80 
CCCCACCAGGGTTGTTCTTTCTGAAATGAAGGAGGCTTTT 5320 
CACGGCCTGGACGTGAAATTCCACACACAGGCTTTTTCCG 5360 
CTCACGGCAGCGGGAGAGAAGTCATTGATGCCATGTGCCA 54 00 



5410 
* i i i I i i i i I i 



5420 5430 5440 



TGCCACCCTAACTTACAGGATGTTGGAACCAACTAGGGTT 54 4 0 
GTTAACTGGGAAGTGATCATTATGGATGAAGCCCATTTTT 54 80 
TGGATCCAGCTAGC ATAGCCGCT AGAGGTTGGGCAGCGC A 5520 
CAGAGCTAGGGCAAATGAAAGTGCAACAATCTTGATGACA 5560 
GCCACACCGCCTGGGACTAGTGATGAATTTCCACATTCAA 5600 
5610 5620 5630 5640 

ATGGTGAAATAGAAGATGTTCAAACGGACATACCCAGTG A 5640 
GCCCTGGAACAC AGGGCATGACTGGATCCTGGCTGAC AAA 5680 
AGGCCCACGGCATGGTTCCTTCCATCCATCAGAGCTGCAA 5720 
ATGTCATGGCTGCCTCTTTGCGTAAGGCTGGAAAGAGTGT 57 60 
GGTGGTCCTGAACAGGAAAACCTTTGAGAGAGAATACCCC 5800 
5810 5820 5830 5840 

ACGATAAAGCAGAAGAAACCTGACTTTATATTGGCCACTG 5840 
ACATAGCTGAAATGGGAGCCAACCTTTGCGTGGAGCGAGT 5880 
GCTGGATTGCAGGACGGCTTTTAAGCCTGTGCTTGTGGAT 5920 
GAAGGGAGGAAGGTGGCAATAAAAGGGCCACTTCGTATCT 5960 
CCGCATCCTCTGCTGCTCAAAGGAGGGGGCGCATTGGGAG 6000 
6010 6020 6030 6040 

AAATCCCAACAGAGATGGAGACTCATACTACTATTCTGAG 604 0 

CCTACAAGTGAAAATAATGCCCACCACGTCTGCTGGTTGG 6080 

AGGCCTCAATGCTCTTGGACAACATGGAGGTGAGGGGTGG 6120 

AATGGTCGCCCCACTCTATGGCGTTGAAGGAACTAAAAC A 6160 

CCAGTTTCCCCTGGTGAAATGAGACTGAGGGATGACCAGA 6200 



6210 



6220 



6230 



6240 



GGAAAGTCTTCAGAGAACTAGTGAGGAATTGTGACCTGCC 62 4 0 

CGTTTGGCTTTCGTGGC AAGTGGCCAAGGCTGGTTTGAAG 6280 

ACGAATGATCGTAAGTGGTGTTTTGAAGGCCCTGAGGAAC 6320 

ATGAGATCTTGAATGACAGCGGTGAAACAGTGAAGTGCAG 6360 

GGCTCCTGGAGGAGCAAAGAAGCCTCTGCGCCCAAGGTGG 6400 



6410 



6420 



6430 



6440 



TGTGATGAAAGGGTGTCATCTGACCAGAGTGCGCTGTCTG 64 4 0 
AATTTATTAAGTTTGCTGAAGGTAGGAGGGGAGCTGCTGA 64 80 
AGTGCTAGTTGTGCTGAGTGAACTCCCTGATTTCCTGGCT 6520 
AAAAAAGGTGGAGAGGCAATGGATACCATCAGTGTGTTCC 6560 
TCCACTCTGAGGAAGGCTCTAGGGCTTACCGCAATGCACT 6600 



6610 



6620 



6630 



6640 



ATCAATGATGCCTGAGGCAATGACAATAGTCATGCTGTTT 664 0 
ATACTGGCTGGACTACTGACATCGGGAATGGTCATCTTTT 6680 
TCATGTCTCCCAAAGGCATCAGTAGAAf GTCTATGGCGAT 6720 
GGGCACAATGGCCGGCTGTGGATATCTCATGTTCCTTGGA 6760 
GGCGTCAAACCCACTCACATCTCCTATGTCATGCTCATAT 6800 



6810 



6820 



6830 



6840 



TCTTTGTCCTGATGGTGGTTGTGATCCCCGAGCCAGGGC A 684 0 
ACAAAGGTCCATCC AAGAC AACCAAGTGGCATACCTCATT 6880 
ATTGGCATCCTGACGCTGGTTTCAGCGGTGGCAGCCAACG 6920 
AGCTAGGCATGCTGG AGAAAACC AAAGAGGACCTCTTTGG 6960 
GAAGAAGAACTTAATTCCATCTAGTGCTTCACCCTGGAGT 7000 
7010 7020 



7030 



7040 



TGGCCGGATCTTGACCTGAAGCCAGGAGCTGCCTGGACAG 704 0 
TGTACGTTGGCATTGTTACAATGCTCTCTCCAATGTTGCA 7 080 
CCACTGGATCAAAGTCGAATATGGCAACCTGTCTCTGTCT 7120 
GGAATAGCCCAGTCAGCCTCAGTCCTTTCTTTCATGGACA 7160 
AGGGGATACCATTCATGAAGATGAATATCTCGGTCATAAT 7200 
7210 7220 7230 7240 

GCTGCTGGTCAGTGGCTGGAATTCAATAACAGTGATGCCT 724 0 
CTGCTCTGTGGCATAGGGTGCGCCATGCTCCACTGGTCTC 7280 
TCATTTTACCTGGAATCAAAGCGCAGCAGTCAAAGCTTGC 7320 
ACAGAGAAGGGTGTTCCATGGCGTTGCCAAGAACCCTGTG 7360 
GTTGATGGGAATCCAACAGTTGACATTGAGGAAGCTCCTG 7400 

7410 7420 7430 7440 
1 ' 1 1 1 ' 1 ' ■ ' ' ■ ' ' ' ■ ■ ■ ■ ' ■ ■ ■ ■ ' ■ ■ • ■ ' • i ■ ■ I ■ ■ ■ ■ i 

AAATGCCTGCCCTTTATGAGAAGAAACTGGCTCTATATCT 74 40 
CCTTCTTGCTCTCAGCCTAGCTTCTGTTGCCATGTGCAGA 74 80 
ACGCCCTTTTCATTGGCTGAAGGCATTGTCCTAGCATCAG 7520 
CTGCCTTAGGGCCGCTCATAGAGGGAAACACCAGCCTTCT 7560 
TTGGAATGGACGCATGGCTGTCTCCATGACAGGAGTCATG 7 600 

7610 7620 7630 

' 1 1 1 1 ■ ■ • ■ 1 • ■ ■ ■ I ■ ■ ■ ■ I 



7640 



_L 



AGGGGGAATCACTATGCTTTTGTGGGAGTCATGTACAATC 7640 
TATGGAAGATGAAAACTGGACGCCGGGGGAGCGCGAATGG 7 680 
AAAAACTTTGGGTGAAGTCTGGAAGAGGGAACTGAATCTG 7720 
TTGGACAAGCGACAGTTTGAGTTGTATAAAAGGACCGACA 77 60 
TTGTGGAGGTGGATCGTGATACGGCACGCAGGCATTTGGC 7800 

7810 7820 7830 

1 1 1 1 1 1 ' 1 ' 1 ■ ' ■ ■ 1 ■ ■ ■ ■ 1 • ■ ■ • ' ■ ■ . , I i , 



7840 
■ ■ ■ ■ * 



CGAAGGGAAGGTGGACACCGGGGTGGCGGTCTCCAGGGGG 784 0 
ACCGCAAAGTTAAGGTGGTTCCATGAGCGTGGCTATGTC A 7 880 
AGCTGGAAGGTAGGGTGATTGACCTGGGGTGTGGCCGCGG 7 920 
AGGCTGGTGTTACTACGCTGCTGCGCAAAAGGAAGTGAGT 7 960 
GGGGTCAAAGGATTTACTCTTGGAAGAGACGGCCATGAGA 8000 
8010 8020 



8030 



8040 



AACCCATGAATGTGCAAAGTCTGGGATGG AACATCATC AC 804 0 
CTTCAAGGACAAAACTGATATCCACCGCCTAGAACCAGTG 8080 
AAATGTG ACACCCTTTTGTGTGACATTGGAGAGTCATCAT 8120 
CGTCATCGGTCACAGAGGGGGAAAGGACCGTGAGAGTTCT 8160 
TGATACTGTAGAAAAATGGCTGGCTTGTGGGGTTGACAAC 8200 
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8210 



8220 



8230 



8240 



TTCTGTGTGAAGGTGTTAGCTCCATACATGCCAGATGTTC 824 0 
TTGAGAAACTGGAATTGCTCCAAAGGAGGTTTGGCGGAAC 8280 
AGTGATCAGGAACCCTCTCTCCAGGAATTCCACTCATGAA 8320 
ATGTACTACGTGTCTGGAGCCCGCAGCAATGTCACATTTA 8360 
CTGTGAACCAAACATCCCGCCTCCTGATGAGGAGAATGAG 84 00 
8410 8420 



8430 



8440 



GCGTCCAACTGGAAAAGTGACCCTGGAGGCTGACGTCATC 844 0 
CTCCCAATTGGGACACGCAGTGTTGAGACAGACAAGGGAC 84 80 
CCCTGGACAAAGAGGCCATAGAAGAAAGGGTTGAGAGGAT 8520 
AAAATCTGAGTACATGACCTCTTGGTTTT ATGACAATGAC 8560 
AACCCCT ACAGGACCTGGCACT ACTGTGGCTCCTATGTCA 8 600 
8610 8620 8630 8640 

CAAAAACCTCCGGAAGTGCGGCGAGCATGGTAAATGGTGT 8 64 0 
TATTAAAATTCTGACATATCCATGGGACAGGATAGAGGAG 8680 
GTCACAAGAATGGCAATGACTGACACAACCCCTTTTGGAC 8720 
AGCAAAGAGTGTTTAAAGAAAAAGTTGACACCAGAGCAAA 8760 
GGATCCACCAGCGGGAACTAGGAAGATCATGAAAGTTGTC 8800 



8810 8820 8830 

1 1 1 1 * 1 1 1 ■ 1 ' 1 ■ ■ 1 ■ • ■ t l . ■ , , ' 



8840 
1 1 ■ I ' 1 ■ « 1 



AACAGGTGGCTGTTCCGCCACCTGGCCAGAGAAAAGAACC 884 0 
CCAGACTGTGCACAAAGGAAGAATTTATTGCAAAAGTCCG 8880 
AAGTCATGCAGCCATTGGAGCTTACCTGGAAGAACAAGAA 8 920 
CAGTGGAAGACTGCCAATGAGGCTGTCCAAGACCCAAAGT 8 960 
TCTGGGAACTGGTGGATGAAGAAAGGAAGCTGCACCAAC A 9000 



9010 9020 
1 1 ■ 1 ■ ■ ■ ■ 1 • ■ ■ • I ■ ■ ■ ■ I 



9030 



9040 



1 ■ ■ I ■ ■ ■ ■ 1 ■ ■ • ■ I ■ ■ ■ ■ i 



AGGCAGGTGTCGGACTTGTGTGTACAACATGATGGGGAAA 9040 

AGAGAGAAGAAGCTGTCAGAGTTTGGG AAAGCAAAGGGAA 9080 

GCCGTGCCAT ATGGT AT ATGTGGCTGGGAGCGCGGTATCT 9120 

TGAGTTTGAGGCCCTGGGATTCCTGAATGAGGACCATTGG 9160 

GCTTCCAGGGAAAACTC AGGAGGAGGAGTGGAAGGCATTG 9200 



9210 9220 
■ I ■ ■ ' ■ I ■ ■ ■ ■ I ■ ■ ■ ■ I ■ • ■ i | i 



9230 
■ . I ■ 



9240 



GCTTACAATACCTAGGATATGTGATCAGAGACCTGGCTGC 9240 
AATGGATGGTGGTGGATTCTACGCGGATGACACCGCTGGA 9280 
TGGGACACGCGCATCACAGAGGCAGACCTTGATGATGAAC 9320 
AGGAGATCTTGAACTACATGAGCCCACATCACAAAAAACT 9360 
GGCACAAGCAGTGATGGAAATGACATACAAGAACAAAGTG 9400 

9410 9420 9430 9440 
■ J ' 1 1 1 1 1 "In" I ' ' ' ■ I .... I ■ ■■■ i ■ ■■■ i ■ | | , | 



GTGAAAGTGTTGAGACCAGCCCCAGGAGGGAAAGCCTACA 94 4 0 

TGGATGTCATAAGTCGACGAGACCAGAGAGGATCCGC-GCA 94 80 

GGTAGTGACTTATGCTCTGAACACCATCACCAACTTGAAA 9520 

GTCCAATTGATCAGAATGGCAGAAGCAGAGATGGTGATAC 9560 

ATCACCAACATGTTCAAGATTGTGATGAATCAGTTCTGAC 9600 
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CAGGCTGGAGGCATGGCTCACTGAGCACGGATGTGACAGA 9640 
CTGAAGAGG ATGGCGGTGAGTGGAGACG ACTGTGTGGTCC 9680 
GGCCCATCGATGACAGGTTCGGCCTGGCCCTGTCCCATCT 9720 
CAACGCCATGTCCAAGGTTAGAAAGGACATATCTGAATGG 97 60 
CAGCCATCAAAAGGGTGGAATGATTGGGAGAATGTGCCCT 9800 
9810 9820 9830 9840 

■ ... I 



■ ■ ■ 1 ■ ■ ■ 



TCTGTTCCCACCACTTCCATGAACTACAGCTGAAGGATGG 9840 
CAGGAGGATTGTGGTGCCTTGCCGAGAACAGGACGAGCTC 98 80 
ATTGGGAGAGGAAGGGTGTCTCCAGGAAACGGCTGGATG A 9920 
TCAAGGAAACAGCTTGCCTCAGCAAAGCCTATGCCAACAT 9960 
GTGGTCACTGATGTATTTTC ACAAAAGGGACATGAGGCTA 10000 
10010 10020 10030 10040 

CTGTCATTGGCTGTTTCCTCAGCTGTTCCCACCTCATGGG 10040 
TTCCACAAGGACGCACAACATGGTCGATTCATGGGAAAGG 10080 
GGAGTGGATGACCACGGAAGACATGCTTGAGGTGTGGAAC 10120 
AGAGTATGGATAACCAACAACCCACACATGCAGGACAAGA 10160 
CAATGGTGAAAAAATGGAGAGATGTCCCTT ATCTAACCAA 10200 
10210 10220 10230 10240 

GAGACAAGACAAGCTGTGCGGATCACTGATTGGAATGACC 10240 
AATAGGGCCACCTGGGCCTCCCACATCCATTTAGTCATCC 10280 
ATCGTATCCGAACGCTGATTGGACAGGAGAAATACACTGA 10320 
CTACCTAACAGTCATGGACAGGTATTCTGTGGATGCTGAC 10360 
CTGCAACTGGGTGAGCTT ATCTGAAACACCATCTAACAGG 10400 
10410 10420 10430 10440 

AATAACCGGGATACAAACCACGGGTGGAGAACCGGACTCC 104 40 
CCACAACCTGAAACCGGGATAT AAACCACGGCTGG AG AAC 10480 
CGGGCTCCGCACTTAAAATGAAAC AG AAACCGGGATAAAA 10520 
ACTACGGATGGAGAACCGGACTCCACACATTGAGACAGAA 10560 
GAAGTTGTCAGCCCAGAACCCCACACGAGTTTTGCCACTG 10600 
10610 10620 10630 10640 



1 11 i i i i I i i i i I i i i i I i i t i I ., ■. i ... i i .... i 

CTAAGCTGTGAGGCAGTGCAGGCTGGGACAGCCGACCTCC 10640 

AGGTTGCGAAAAACCTGGTTTCTGGGACCTCCCACCCCAG 10680 

AGTAAAAAGAACGGAGCCTCCGCT ACCACCCTCCC ACGTG 10720 

GTGGTAGAAAGACGGGGTCTAGAGGTTAGAGGAGACCCTC 107 60 

CAGGGAACAAATAGTGGGACCATATTGACGCCAGGGAAAG 10800 
10810 10820 

ACCGGAGTGGTTCTCTGCTTTTCCTCCAGAGGTCTGTGAG 1084 0 

CACAGTTTGCTCAAGAATAAGCAGACCTTTGGATGACAAA 10880 
CACAAAACCACT 108 92 
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10 20 30 40 

■ 1 ■ 1 I ■ ■ ■ 1 1 1 ■ ■ ■ 1 ■ ■ ■ ■ 1 ■ ■ ■ ■ I ■ ■ ■ ■ ' ■ ■ ■ ■ I ■ • ■ • i 

MSGRKAQGKTLGVNMVRRGVRSLSNKIKQKTKQIGNRPGP 4 0 
SRGVQGFIFFFLFNILTGKKITAHLKRLWKMLDPRQGLAV 8 0 
LRKVKR VVASLMRGL S SRKRRS H D VLT VQFL I LGMLLMTG 120 
GMKLSNFQGKLLMTINNTDIADVI VI PTSKGENRCWVRAI 160 
DVGYMCEDTITYECPKLTMGNDPEDVDCWCDNQEVYVQYG 200 

210 220 230 240 

■ 1 ■ 1 I 1 1 1 1 1 ' ■ ' 1 1 1 ■ ■ » 1 ■ ■ ■ ■ 1 ■ ■ • ■ ' ■ • ■ ■ I ■ ■ • ■ I 

RCTRTRHSKRSRRSVSVQTHGESSLVNKKEAWLDSTKATR 24 0 
YLMKTENWIIRNPGYAFLAAVLGWMLGSNNGQRVVFTILL 280 
LLVAPAYSFNCLGMGNRDFIEGASGATWVDLVLEGDSCLT 320 
IMANDKPTLDVRMINIEASQLAEVRS YCYHASVTDISTVA 360 
RCPTTGEAHNEKRADSSYVCKQGFTDRGWGNGCGFFGKGS 4 00 

410 420 430 440 
'■''*'■■'*«■■■ 1 1 I ■■ ■■ I ■ ■■■ I 

IDTCAKFSCTSKAIGRTIQPENIKYKVGIFVHGTTTSENH 440 
GNYSAQVGASQAAKFTVTPNAPSVALKLGDYGEVTLDCEP 4 80 
RSGLNTEAFYVMTVGSKSFLVHREWFHDLALPWTSPSSTA 520 
WRNRELLMEFEGAHATKQSVVALGSQEGGLHHALAGAI VV 560 
E YS S S VMLTSGHLKCRLKMDKLALKGTT YGMCTEKFS FAK 600 

610 620 630 640 
' 1 1 ' I 1 1 ■ 1 1 1 ■ ■ 1 1 ■ ■ ■ « 1 « ■ ■ • 1 ■ ■ ■ ■ I ■ ■ • ■ I ■ ■ ■ • I 

NPVDTGHGTVVIELSYSGSDGPCKIPIVSVASLNDMTPVG 640 
RL VT VN P FVAT S S AN SKVLVEME P PFGDSYI VVGRG DKQ I 680 
NHHWHKAGSTLGKAFSTTLKGAQRLAALGDTAWDFGSIGG 720 
VFNSIGRAVHQVFGGAFRTLFGGMSWITQGLMGALLLWMG 7 60 
VNARDRSIALAFLATGGVLVFLATNVGADQGCAINFGKRE 8 00 

810 820 830 840 

■ 1 ' ' I ' ■ ' ■ I ■ ■ < ' ' ■ ■ ■ ■ ' ■ ■ ■ ■ ' ■ ■ ■ ■ I ■ ■ ■ ■ I ■ ■ ■ . I 

LKCGDGIFI FRDSDDWLNKYSYYPEDPVKLASI VKASFEE 8 4 0 
GKCGLNSVDSLEHEMWRSRADEINAIFEENEVDISVVVQD 8 80 
PKNVYQRGTH P FS RI RDGLQYGWKTWGKNLVFS PGRKNGS 920 
FIIDGKSRKECPFSNRVWNSFQIEEFGTGVFTTRVYMDAV 9 60 
FEYTIDCDGSILGAAVNGKKSAHGSPTFWMGSHEVNGTWM 1000 

1010 1020 1030 1040 
■' ' ' 1 I 1 1 1 1 * ■ 1 ■ ■ 1 1 ■ ■ ■ 1 ■ ■ ■ ■ 1 ■ ■ ■ ■ I ■ ■ ■ ' I ■ ■ ■ ■ I 

IHTLEALDYKECEWPLTHTIGTSVEESEMFMPRSIGGPVS 104 0 
SHNHI PGYKVQTNGPWMQVPLEVKREACPGTSVI I DGNCD 1080 
GRGKSTRSTTDSGKVIPEWCCRSCTMPPVSFHGSDGCWY? 1120 
MEIRPRKTHESHLVRSWVTAGEIHAVPFGLVSMMIAMEVV 1160 
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1210 1220 1230 1240 

i i i i I i i i i I i i i i I i i i i I i i i i I ■ i ■■ I i t I I 

HFHEMNNGGDAMYMALIAAFSIRPGLLIGFGLRTLWSPRE 124 0 
RLVLTLGAAMVEIALGGVMGGLWKYLNAVSLCILTINAVA 1280 
SRKASNTILPLMALLTPVTMAEVRLAAMFFCAMVIIGVLH 1320 
QNFKDTSMQKTIPLVALTLTSYLGLTQPFLGLCAFLATRI 13 60 
FGRRSIPVNEALAAAGLVGVLAGLAFQEMENFLGPIAVGG 14 00 

1410 1420 1430 1440 
i i i i I i i i i I i i i i I i i i i I i ■ i i I i ■■ t I i i t i I .... I 

LLMMLVSVAGRVDGLELKKLGEVSWEEEAEISGSSARYDV 14 4 0 
ALSEQGEFKLLSEEKVPWDQVVMTSLALVGAALHPFALLL 14 80 
VLAGWLFHVRGARRSGDVLWDIPTPKIIEECEHLEDGIYG 1520 
I FQSTFLGASQRGVGVAQGGVFHTMWHVTRGAFLVRNGKK 1560 
LIPSWASVKEDLVAYGGSWKLEGRWDGEEEVQLIAAVPGK 1600 

1610 1620 1630 1640 
1 1 ■ ■ I ■ 1 * * * ' ■ * ' 1 * « ■ ■ I * ■ ■ ■ I ■ « ■ i * » ■ ■ ■ I * » ■ ■ 1 

NVVNVQTKPSLFKVRNGGEIGAVALDYPSGTSGSPIVNRN 164 0 

GEVIGLYGNGILVGDNSFVSAISQTEVKEEGKEELQEIPT 1680 

MLKKGMTTVLDFHPGAGKTRRFLPQILAECARRRLRTLVL 1720 

APTRVVLSEMKEAFHGLDVKFHTQAFSAHGSGREVIDAMC 17 60 

HATLTYRMLEPTRVVNWEVI IMDEAHFLDPAS I AARGWAA 1800 

1810 1820 1830 1840 
1 ■ 1 ■ ^ > * 1 » I * ■ ■ » I * * ■ ■ I ■ ■ ■ ■ I » » ■ t I ■ * * t I t . . . I 

HRARANESATILMTATPPGTSDEFPHSNGEIEDVQTDIPS 184 0 

EPWNTGHDWILADKRPTAWFLPSIRAANVMAASLRKAGKS 1880 

WVLNRKTFEREYPTIKQKKPDFILATDIAEMGANLCVER 1920 

VLDCRTAFKPVLVDEGRKVAIKGPLRISASSAAQRRGRIG 1960 

RNPNRDGDSYYYSEPTSENNAHHVCWLEASMLLDNMEVRG 2000 

2010 2020 2030 2040 
' ' »' I i i i i I ti.t I i i i i I ■ ... I .... I .... I ... . I 

GMVAPLYGVEGTKTPVSPGEMRLRDDQRKVFRELVRNCDL 204 0 

PVWLSWQVAKAGLKTNDRKWCFEGPEEHEILNDSGETVKC 2080 

RAPGGAKKPLRPRWCDERVSSDQSALSE FIKFAEGRRGAA 2120 

EVLVVLSELPDFLAKKGGEAMDTISVFLHSEEGSRAYRNA 2160 

LSMMPEAMTIVMLFILAGLLTSGMVIFFMSPKGISRMSMA 2200 

2210 2220 2230 2240 
' ' ' ' I i i i ■ 1 i i . . I .... 1 .... I ■ , , . 1 , , . . I , , . . I 

MGTMAGCG YLMFLGGVKPTH I S YVMLI FFVLMVVVI PE PG 224 0 

QQRSIQDNQVAYLIIGILTLVSAVAANELGMLEKTKEDLF 2280 

GKKNLIPSSASPWSWPDLDLKPGAAWTVYVGIVTMLSPML 2320 

HHWIKVEYGNLSLSGIAQSASVLSFMDKGIPFMKMNISVI 23 60 

MLLVSGWNSITVMPLLCGIGCAMLHWSLILPGIKAQQSKL 24 00 

2410 2420 2430 2440 
' ' 'I t i ■ ■ I t t . t 1 . . i i I . . . i I . . . . I . . . . 1 . . . . I 

AQRRVFHGVAKNPVVDGNPTVDIEEAPEMPALYEKKLALY 24 4 0 

LLLALSLASVAMCRTPFSLAEGIVLASAALGPLIEGNTSL 2 4 80 

LWNGPMAVSMTGVMRGNHYAFVGVMYNLWKMKTGRRGSAN 2520 

GKTLGEVWKRELNLLDKRQFELYKRTDIVEVDRDTARRHL 2 5 60 

AEGKVDTGVAVSRGTAKLRWFHERGYVKLEGRVIDLGCGR 2 600 
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2610 2620 2630 2640 

* * ■ » I 1 1 * ■ I ■ » ■ > 1 » ' » * 1 ■ ■ ■ > I ■ ■ ■ * 1 ■ ■ ■ ■ I ■ « ■ ■ I 

GGWC YYAAAQKEVSGVKGFTLGRDGHEKPMNVQSLGWNI I 264 0 
TFKDKTDIHRLEPVKCDTLLCDIGESSSSSVTEGERTVRV 2680 
LDTVEKWLACGVDNFCVKVLAPYMPDVLEKLELLQRRFGG 2720 
TVIRNPLSRNSTHEMYYVSGARSNVTFTVNQTSRLLMRRM 27 60 
RRPTGKVTLEADVILPIGTRSVETDKGPLDKEAIEERVER 2800 

2810 2820 2830 2840 

* 1 ■ 1 I » ■ * ■ I ■ * 1 » 1 ■ 1 1 * I ■ * » * I ■ ■ « « I ■ « « ■ I ■ « « ■ I 

IKSEYMTSWFYDNDNPYRTWHYCGSYVTKTSGSAASMVNG 284 0 

VIKILTYPWDRIEEVTRMAMTDTTPFGQQRVFKEKVDTRA 2880 

KDPPAGTRKIMKVVNRWLFRHLAREKNPRLCTKEEFIAKV 2 920 

RSHAAIGAYLEEQEQWKTANEAVQDPKFWELVDEERKLHQ 2 960 

QGRCRTCVYNMMGKREKKLSEFGKAKGSRAIWYMWLGARY 3000 

3010 3020 3030 3040 

1 1 1 ' I ' 1 1 1 1 1 1 1 1 I ' 1 ■ 1 I ' ■ ' ' I ' ■ ' ■ I ' « ' » I ■ ■ » ' I 

LEFEALGFLNEDHWASRENSGGGVEGIGLQYLGYVI'RDLA 304 0 
AMDGGGFYADDTAGWDTRITEADLDDEQEILNYMSPHHKK 3080 
LAQAVMEMTYKNKVVKVLRPAPGGKAYMDVISRRDQRGSG 3120 
QVVTYALNTITNLKVQLIRMAEAEMVIHHQHVQDCDESVL 3160 
TRLEAWLTEHGCDRLKRMAVSGDDCVVRPIDDRFGLALSH 3200 

3210 3220 3230 3240 

I I I I I I I I I 1 ■ ■ t I I I I I I I ■ ■ I I I ■ I . | I , ! ! | | , , , , | 

LNAMSKVRKDISEWQPSKGWNDWENVPFCSHHFHELQLKD 324 0 
GRRIVVPCREQDELIGRGRVSPGNGWMIKETACLSKAYAN 3280 
MWSLMYFHKRDMRLLSLAVSSAVPTSWVPQGRTTWSIHGK 3320 
GEWMTTEDMLEVWNRVWITNNPHMQDKTMVKKWRDVPYLT 3360 
KRQDKLCGSLIGMTNRATWASHIHLVIHRIRTLIGQEKYT 34 00 

3410 3420 3430 3440 
i i i t I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

DYLTVMDRYSVDADLQLGELI . 3422 



